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METHOD AND APPARATUS FOR COALESCING 



INPUT OUTPUT ACCESSES TO A VIRTUAL DEVICE 



FIELD 

5 An embodiment of the present invention relates to virtualization. More specifically, an 

embodiment of the present invention relates to a method and apparatus for coalescing input and 
output (10) accesses to a virtual device. 

BACKGROUND 

1 0 Recently industry trends, such as server consolidation and the proliferation of 

inexpensive shared-memory multiprocessors, have fueled a resurgence of interest in server 
virtualization techniques. Virtual machines are particularly attractive for server virtualization. 
Each virtual machine is given the illusion of executing on a dedicated physical machine that is 
fully protected and isolated from other virtual machines. Virtual machines can also be convenient 

1 5 abstractions of server workloads, because they can cleanly encapsulate the entire state of a 
running system, including both user-level applications and kernel mode operating system 
services. In many computing environments, individual servers are underutilized, allowing them 
to be consolidated as virtual machines on a single physical server with little or no performance 
penalty. Similarly, many small servers can be consolidated onto fewer larger machines to 

20 simplify management and reduce costs. 

When an application on a virtual machine requires access to an input output device via 
repeated access to one or more control registers, the virtualization event causes inefficiencies 
which require emulation of each access before resuming execution of the application. The 
virtualization events are extremely expensive compared to other operations, costing more than 

25 twice that of hardware interrupts. 
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One known technique used to address the issues associated with a virtualization event 
involves creating special device drivers for commonly used devices. These special device drivers 
would be written to minimize the number of exits from a virtual machine and the number of 
general protection faults generated by the system. Although this technique was effective in some 
5 instances, these special device drivers were not always available for all devices or all operating 
systems that may wish to access the devices. 

Thus, what is needed is an effective and efficient method and apparatus for managing 
input output accesses to a virtual device. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The features and advantages of the present invention are illustrated by way of example 
and are by no means intended to limit the scope of the present invention to the particular 
embodiments shown, and in which: 
5 Figure 1 is a block diagram that illustrates components of a system in which an 

embodiment of the invention resides; 

Figure 2 illustrates an embodiment of the physical machine in the system according to an 
embodiment of the present invention; 

Figure 3 is a block diagram that illustrates sub-components residing in the components of 
10 the system according to an embodiment of the invention; 

Figure 4 illustrates components of a virtual machine monitor according to an embodiment 
of the present invention; 

Figure 5 is a flow chart of a method for performing input output accesses on a virtual 
device according to an embodiment of the present invention; 
1 5 Figure 6 is a flow chart of a method for performing input output accesses on a virtual 

device according to a second embodiment of the present invention; 

Figure 7 is a flow chart of a method for performing input output accesses on a virtual 
device according to a third embodiment of the present invention; and 

Figure 8 is an exemplary instruction stream that is processed may by a virtual machine 
20 monitor according to an embodiment of the present invention. 
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DETAILED DESCRIPTION 

In the following description, for purposes of explanation, specific nomenclature is set 
forth to provide a thorough understanding of embodiments of the present invention. However, it 
will be apparent to one skilled in the art that these specific details may not be required to practice 
5 the embodiments of the present invention. In other instances, well-known circuits, devices, and 
programs are shown in block diagram form to avoid obscuring embodiments of the present 
invention unnecessarily. 

Figure 1 is a block diagram that illustrates components of a system 100 in which an 
embodiment of the invention resides. The system includes a physical machine 110: According to 
10 one embodiment, the physical machine 1 10 may be components of a computer system (not 
shown). The computer system may include, for example, a processor, a memory, buses, and 
various input output (IO) devices (not shown). 

Figure 2 is a block diagram of an exemplary computer system 200 according to an 
embodiment of the present invention. The computer system 200 may be used to implement the 
15 physical machine 1 10 shown in Figure 1. The computer system 200 includes a processor 201 that 
processes data signals. The processor 201 may be a complex instruction set computer 
microprocessor, a reduced instruction set computing microprocessor, a very long instruction word 
microprocessor, a processor implementing a combination of instruction sets, or other processor 
device. Figure 2 shows the computer system 200 with a single processor. However, it is 
20 understood that the computer system 200 may operate with multiple processors. The processor 
201 is coupled to a CPU bus 210 that transmits data signals between processor 201 and other 
components in the computer system 200. 

The computer system 200 includes a memory 213. The memory 213 may be a dynamic 
random access memory device, a static random access memory device, or other memory device. 
25 The memory 213 may store instructions and code represented by data signals that may be 

executed by the processor 201. A cache memory 202 resides inside processor 201 that stores data 
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signals stored in memory 213. The cache 202 speeds up memory accesses by the processor 201 
by taking advantage of its locality of access. In an alternate embodiment of the computer system 
200, the cache 202 resides external to the processor 201. A bridge memory controller 21 1 is 
coupled to the CPU bus 210 and the memory 213. The bridge memory controller 21 1 directs data 
5 signals between the processor 20 1 , the memory 213, and other components in the computer 
system 200 and bridges the data signals between the CPU bus 210, the memory 213, and a first 
IO bus 220. 

The first IO bus 220 may be a single bus or a combination of multiple buses. The first IO 
bus 220 provides communication links between components in the computer system 200. A 

1 0 network controller 22 1 is coupled to the first IO bus 220. The network controller 22 1 may link 
the computer system 200 to a network of computers (not shown) and supports communication 
among the machines. A display device controller 222 is coupled to the first IO bus 220. The 
display device controller 222 allows coupling of a display device (not shown) to the computer 
system 200 and acts as an interface between the display device and the computer system 100. 

15 A second IO bus 230 may be a single bus or a combination of multiple buses. The 

second IO bus 230 provides communication links between components in the computer system 
200. A data storage device 23 1 is coupled to the second IO bus 230. The data storage device 23 1 
may be a hard disk drive, a floppy disk drive, a CD-ROM device, a flash memory device or other 
mass storage device. An input interface 232 is coupled to the second IO bus 230. The input 

20 interface 232 may be, for example, a keyboard and/or mouse controller or other input interface. 
The input interface 232 may be a dedicated device or can reside in another device such as a bus 
controller or other controller. The input interface 232 allows coupling of an input device to the 
computer system 200 and transmits data signals from an input device to the computer system 200. 
An audio controller 233 is coupled to the second IO bus 230. The audio controller 233 operates 

25 to coordinate the recording and playing of sounds and is also coupled to the IO bus 230. A bus 
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bridge 223 couples the first IO bus 220 to the second IO bus 230. The bus bridge 223 operates to 
buffer and bridge data signals between the first IO bus 220 and the second IO bus 230. 

Referring back to Figure 1, the system 100 includes a virtual machine monitor (VMM) 
120. The VMM 120 is a layer that interfaces the physical machine 110 and that facilitates a 
5 plurality of virtual machines (VMs) 130 to be run. The virtual machine monitor 120 manages and 
mediates computer system resources in the physical machine 110 between the virtual machines 
and allows the isolation of or data sharing between virtual machines. The virtual machine 
monitor 120 achieves this by virtualizing resources in the physical machine 110 and exporting a 
virtual hardware interface that could reflect an underlying architecture of the physical machine 

10 1 10, a variant of the physical machine, or an entirely different physical machine. 

The system 100 also includes one or more virtual machines 131-134 (collectively shown 
as 130). According to an embodiment of the present invention, a virtual machine may be 
described as an isolated virtual replica of a machine including, but not limited to, a replica of the 
physical machine, a subset of the physical machine, or replica of an entirely different machine. 

15 The virtual machine may include the resources of the computer system in the physical machine 
1 10, a subset of the resources of the computer system in the physical machine 1 10, or entirely 
virtual resources not found in the physical machine. 

According to an embodiment of the present invention, the virtual machine monitor 120 
takes complete control of the physical machine 1 10 and creates virtual machines 130, each of 

20 which behaves like a complete physical machine that can run its own operating system (OS). 
Virtual machines 13 1-134 may run operating systems 141-144 respectively where the operating 
systems 141-144 may be unique to one another. To maximize performance, the virtual machine 
monitor 120 allows a virtual machine to execute directly on the resources of the computer system 
in the physical machine 1 10 when possible. The virtual machine monitor 120 takes control, 

25 however, whenever a virtual machine attempts to perform an operation that may affect the 

operation of other virtual machines or of the operation of resources in the physical machine 110. 



The virtual machine monitor 120 emulates the operation and returns control to the virtual 
machine when the operation is completed. 

In virtualizing 10 devices, the virtual machine manager 120 intercepts IO operations 
issued by an operating system on a virtual machine. The 10 operations may be, for example, EN 
5 and OUT instructions or memory mapped 10 instructions. The 10 instructions directing IO 
operations are trapped by the virtual machine manager 120 and are emulated by the virtual 
machine manager 120. IO instructions typically include addresses that are not valid outside the 
virtual machine. Emulation of these instructions becomes necessary because the instructions 
cannot be executed natively on the resources of the physical machine 110. An IO operation is 

10 considered a virtualization event (VM exit) since it requires its corresponding IO instruction to be 
emulated by the virtual machine manager 120. A virtualization event requires storing the state of 
the processor for a current virtual machine and reloading this state when the virtualization event 
is complete. This may require several thousand processor clock cycles which is costly. 
According to an embodiment of the present invention, the virtual machine manager 120 coalesces 

15 a plurality of IO instructions in an instruction stream and emulates them during a single 
virtualization event in order to reduce the overall cost of executing an instruction stream. 

Figure 3 is a block diagram that illustrates the sub-components residing in the 
components of the system according to an embodiment of the invention. The system 300 
includes components similar to the components in the system 100 illustrated in Figure 1. A first 

20 application 3 1 1 running in an application space of a first virtual machine 13 1 may include 

instructions in its instruction stream to access physical device 340 in the physical machine 110. 
A second application 312 running in an application space of a second virtual machine 132 may 
include instructions in its instruction stream to access physical device 340 in the physical machine 
1 10. A device driver 321 running in a kernel space of a first operating system 141 of the first 

25 virtual machine 131 communicates with a virtual device 33 1 in the virtual machine monitor 120 
via a virtualization event dispatcher (VED) 332. A device driver 322 running in a kernel space of 



a second operating system 142 of the second virtual machine 132 communicates with the virtual 
device 33 1 in the virtual machine monitor via the visualization event dispatcher 332. The virtual 
device 331 virtualizes the functionalities of the physical device 340. The virtual device 330 also 
facilitates correct emulation of the physical device 340 to the device drivers 321 and 322 utilizing 
5 emulator 333 and coordinates access to the physical device 340 when necessary. 

Figure 4 illustrates a virtualization event dispatcher 400 according to an embodiment of 
the present invention. The virtualization event dispatcher 400 may be used to implement the 
virtualization event dispatcher 332 shown in Figure 3. The virtualization event dispatcher 400 
includes an instruction interpreter unit 410. The instruction interpreter unit 410 receives 

10 instructions from device drivers in virtual machines. The instruction interpreter unit 410 

determines whether an instruction received that causes a virtualization event is an IO instruction 
directing an IO operation. According to an embodiment of the present invention, the instruction 
interpreter unit 410 makes this determination by identifying whether the instruction received is an 
IN and OUT instruction or a memory mapped IO instruction. 

15 The virtualization event dispatcher 400 includes an instruction scanning unit 420. In 

response to the instruction interpreter unit 410 determining that an instruction received is an IO 
instruction, the instruction scanning unit 420 accesses the instruction stream originating the 
received IO instruction. According to an embodiment of the virtualization event dispatcher 400, 
the instruction scanning unit 420 interfaces with the virtual machine running the instruction 

20 stream in order to access the instruction stream. The instruction scanning unit 420 scans the 

instruction stream to determine whether additional IO instructions are present within an extent of 
instructions in the instruction stream. 

In one embodiment, the extent of instructions used for determining whether additional IO 
instructions are present is determined based on the type of hardware components in the physical 

25 machine 1 10 (shown in Figure 1). The type of hardware components may include, for example, 
the type of processor in the physical machine 110. In this example, the more powerful the 



processor, the greater number of lines may be designated for the extent. The extent of 
instructions used for determining whether additional 10 instructions are present may also be 
determined by the type of software on the virtual machine and/or other criteria. 

If the instruction scanning unit 420 determines that additional 10 instructions are present 
5 within the extent of instructions, the additional 10 instructions along with any intermediate non- 
10 instructions in the instruction stream are grouped together and identified with the original 
received 10 instruction as a block of instructions. The block of instructions is directed to be 
emulated in the virtual machine monitor during the virtualization event. 

The virtualization event dispatcher 400 includes an instruction pointer update unit 430. 

10 The instruction pointer update unit 430 interfaces with the instruction pointers in the virtual 
machines of a system. When instructions in an instruction stream have been emulated by a 
virtual machine monitor, the instruction pointer update unit 430 updates the instruction pointer of 
the virtual machine running the instruction stream to move past the instructions that were 
emulated. Upon completion of the virtualization event, the instruction pointer of the virtual 

15 machine is directed to accurately point to the appropriate instruction that should be executed 
when control is returned to the virtual machine. 

The virtualization event dispatcher 400 includes a hashing unit 440. According to an 
embodiment of the present invention, the hashing unit 440 performs a hash function on a block of 
instructions identified by the instruction scanning unit 420 and the address of the originally 

20 received IO instruction. According to one embodiment, the hashing unit 440 stores the computed 
hash value with the received IO address and a value indicating the size of the block of 
instructions in a table (not shown). The table may reside in a virtual device corresponding to a 
physical device with which the IO instruction is directed to. In this embodiment, the hashing unit 
440 searches for addresses of IO instructions identified by the instruction interpreter unit 410 in 

25 the table. If a matching address is found, a block of instructions from the instruction stream is 
identified using the information corresponding to the matching address in the table. The hashing 
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unit 440 performs a hash function on the block of instructions and the address of the IO 
instruction identified by the instruction interpreter unit 410. If the computed hash value matches 
the hash value corresponding to the matching address on the table, the block of instructions is 
emulated. 

5 According to an alternate embodiment of the present invention, the hashing unit 440 

performs a hash function on the block of instructions identified by the instruction scanning unit 
420. The hashing unit 440 stores the hashed value of the block of instruction into a table. 
According to an embodiment of the virtualization event dispatcher 400, the block of instructions 
identified by the instruction scanning unit 420 is emulated by the virtual machine monitor only 

10 upon determining that the block had been previously identified by the instruction scanning unit 
420. In this embodiment, the hashing unit 440 compares a hashed value of a block of instructions 
with other hashed values in the table. If the hashed value of the block matches a hashed value in 
the table, the hashing unit 440 determines that the block of instructions had been previously 
identified by the instruction scanning unit. If the hashed value of the block does not match the 

15 hashed values in the table, the hashing unit 440 stores the hashed value in the table and only the 
received IO instruction, not the entire block of instructions, is emulated. 

According to another embodiment of the present invention, the hashing unit 440 performs 
a hash function on the address of the IO instruction received by the instruction identifier 410. In 
this embodiment, the address is a physical address of the IO instruction in a memory in the 

20 physical machine. In response to the instruction scanning unit 420 determining that additional IO 
instructions are present within an extent of the received IO instruction in instructions stream, the 
hashing unit 440 writes the hashed address of the received IO instruction in the table. The 
hashing unit 440 also writes information regarding the size or length of the block of instructions 
as determined by the instruction scanning unit 420. In this embodiment, the hashing unit 440 

25 compares the hashed value of an address of a received IO instruction with hashed values in the 
table. If the hashed value of the address matches a hashed value in the table, the hashing unit 440 
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directs the block of instructions recorded in the table to be emulated by the virtual machine 
monitor without requiring the instruction scanning unit 420 to scan an instruction stream 
corresponding to the received IO instruction. 

Figure 5 is a flow chart of a method for performing input output accesses on a virtual 
5 device according to an embodiment of the present invention. At 501, normal execution of an 
instruction stream of an application on a virtual machine is performed. 

At 502, upon encountering a virtualization event, it is determined whether the 
virtual izat ion event is caused by an IO operation. According to an embodiment of the present 
invention, determining whether the virtualization event is caused by an IO operation is achieved 
10 by determining whether the instruction causing the virtualization event is an IN or OUT 

instruction or a memory mapped IO instruction. If the virtualization event is not caused by an IO 
operation, control returns to 501. If the virtualization event is caused by the IO operation, control 
proceeds to 503. 

At 503, it is determined whether the address of the IO instruction for the IO operation is 
15 referenced in a table. According to an embodiment of the present invention, the table identifies 
blocks of instructions that may be emulated during a single virtualization event. The blocks of 
instructions are identified by an address of an IO instruction starting the block, a hash value of the 
instructions in the block and the address of the IO instruction, and a value indicating the size of 
the block. If the address of the IO instruction for the IO operation is stored in the table, control 
20 proceeds to 5 1 1 . If the address of the IO instruction is not stored on the table, control proceeds to 
504. 

At 504, the instruction stream is scanned. According to an embodiment of the present 
invention, the instruction stream is scanned for IO instructions within an extent of instructions 
from the IO instruction triggering the virtualization event. The extent may be determined based 
25 upon criteria such as the type of hardware components on the system and/or the type of software 
being run on the system. If other IO instructions exist within the extent of instructions from the 
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IO instruction triggering the virtualization event, these 10 instructions together with any non-IO 
instructions between the 10 instructions are grouped together and given a designation of a block 
of instructions. 

At 505, it is determined whether the 10 operation is a singular access. If the IO operation 
5 is a single 10 operation within an extent of instructions from the 10 instruction triggering the 

virtualization event, control proceeds to 506. If the 10 operation is not a single 10 operation with 
an extent of instructions from the IO instruction triggering the virtualization event, control 
proceeds to 507. 

At 506, the IO instruction triggering the virtualization event is emulated. Control 
10 proceeds to 501. 

At 507, a hash function is performed on the block of instructions determined by 504. 
According to one embodiment, a hash function is performed on the block of instructions and the 
address of the IO instruction triggering the virtualization event. 

At 508, the address of the IO instruction, the hashed value computed at 507, and a value 
1 5 indicating the size of the block of instructions determined at 504 are stored in the table. 

At 509, all of the instructions in the block of instructions are emulated. According to an 
embodiment of the present invention, a plurality of IO instructions is emulated during a single 
virtualization event. 

At 510, an instruction pointer of the virtual machine executing the instruction stream is 
20 updated to indicate that the instructions in the block of instructions have been executed. Control 
proceeds to 501. 

At 5 1 1 , a hash function is performed on a block of instructions and the address of the IO 
instruction received. The block of instructions is determined from a value indicating a size of the 
block in the table. 
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At 5 12, it is determined whether the hash value computed at 5 1 1 matches a hash value 
corresponding to the IO address stored on the table. If the hash value matches, control proceeds 
to 509. If the hash value does not match, control proceeds to 513. 

At 513, it is determined whether the IO address is referenced at another location in the 
5 table. If the IO address is referenced at another location in the table, control proceeds to 5 1 1 . If 
the IO address is not referenced at another location in the table, control proceeds to 506. 

It should be appreciated that once a singular access is determined at 505, that the singular 
access address may be stored on the table. In this embodiment, when an address match is 
determined at 503, it is determined whether the address is associated with a singular access. If it 
10 is determined that the address is associated with a singular access, the singular access IO 

instruction is emulated and control proceeds to normal execution 501 . If it is determined that the 
address is not associated with a singular access, control proceeds to 5 1 1 . This allows the number 
of instruction stream scans to be reduced. 

It should be further appreciated that instead of computing and comparing a hash value for 
1 5 the block of instructions and the address of the received IO instruction (as shown in 507, 5 1 1 , and 
5 12), a hash value for the block of instructions alone may be computed and compared. 

Figure 6 is a flow chart of a method for performing input output accesses on a virtual 
device according to a second embodiment of the present invention. At 601, normal execution of 
an instruction stream in an application on a virtual machine is performed. 
20 At 602, upon encountering a virtualization event, it is determined whether the 

virtualization event is caused by an IO operation. According to an embodiment of the present 
invention, determining whether the virtualization event is caused by an IO operation is achieved 
by determining whether the instruction causing the virtualization event is an IN or OUT 
instruction or a memory mapped IO instruction. If the virtualization event is not caused by an IO 
25 operation, control returns to 601 . If the virtualization event is caused by the IO operation, control 
proceeds to 603. 
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At 603, the instruction stream is scanned. According to an embodiment of the present 
invention, the instruction stream is scanned for IO instructions within an extent of instructions 
from the IO instruction triggering the virtualization event. The extent may be determined based 
upon criteria such as the type of hardware components on the system and/or the type of software 
5 being run on the system. If other IO instructions exist within the extent of instructions from the 
IO instruction triggering the virtualization event, these IO instructions together with any non-IO 
instructions between the IO instructions are grouped together and given a designation of a block 
of instructions. 

At 604, it is determined whether the IO operation is a singular access. If the IO operation 
10 is a single IO operation within an extent of instructions from the IO instruction triggering the 

virtualization event, control proceeds to 605. If the IO operation is not a single IO operation with 
an extent of instructions from the IO instruction triggering the virtualization event, control 
proceeds to 606. 

At 605, the IO instruction triggering the virtualization event is emulated. Control 
15 proceeds to 601. 

At 606, a hash function is performed on the block of instructions determined from 604 to 
generate a hash value. 

At 607, the hash value is compared with other hash values in a table. If the hash value 
does not match another hash value in the table, control proceeds to 608. If the hash value 
20 matches another hash value in the table, control proceeds to 609. 

At 608, the hash value is stored in the table. Control proceeds to 605. 

At 609, all of the instructions in the block of instructions are emulated. According to an 
embodiment of the present invention, a plurality of IO instructions is emulated during a single 
virtualization event. Control proceeds to 610. 
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At 610, an instruction pointer of the virtual machine executing the instruction stream is 
updated to indicate that the instructions in the block of instructions have been executed. Control 
proceeds to 601. 

Figure 7 is a flow chart of a method for performing input output accesses on a virtual 
5 device according to a third embodiment of the present invention. At 701, normal execution of an 
instruction stream of an application on a virtual machine is performed. 

At 702, upon encountering a virtualization event, it is determined whether the 
virtualization event is caused by an IO operation. According to an embodiment of the present 
invention, determining whether the virtualization event is caused by an IO operation is achieved 
10 by determining whether the instruction causing the virtualization event is an EN or OUT 

instruction or a memory mapped IO instruction. If the virtualization event is not caused by an IO 
operation, control returns to 701. If the virtualization event is caused by the IO operation, control 
proceeds to 703. 

At 703, a hash function is performed on the physical address of the IO instruction to 
1 5 generate a hashed address value. According to an embodiment of the present invention, the 
physical address is an address of the instruction in the memory in the physical machine. 

At 704, a determination is made as to whether the hashed address value matches hash 
values in a table. If a determination is made that the hashed address value matches a hash value 
in the table, control proceeds to 705. If a determination is made that the hashed address value 
20 does not match a hash value in the table, control proceeds to 707. 

At 705, a block of instructions associated with the matching hash value in the table is 
emulated. 

At 706, an instruction pointer of the virtual machine executing the instruction stream is 
updated to indicate that the instructions in the block of instructions have been executed. Control 
25 proceeds to 701. 
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At 707, the instruction stream is scanned. According to an embodiment of the present 
invention, the instruction stream is scanned for IO instructions within an extent of instructions 
from the IO instruction triggering the virilization event. The extent may be determined based 
upon criteria such as the type of hardware components on the system and/or the type of software 
5 being run on the system. If other IO instructions exist within the extent of instructions from the 
IO instruction triggering the virtualization event, these IO instructions together with any non-IO 
instructions between the IO instructions are grouped together and given a designation of a block 
of instructions. 

At 708, it is determined whether the IO operation is a singular access. If the IO operation 
10 is a single IO operation within an extent of instructions from the IO instruction triggering the 

virtualization event, control proceeds to 709. If the IO operation is not a single IO operation with 
an extent of instructions from the IO instruction triggering the virtualization event, control 
proceeds to 710. 

At 709, the IO instruction triggering the virtualization event is emulated. Control 
15 proceeds to 701. 

At 710, the hashed address value and data regarding the block of instruction determined 
by 707 is stored in the table. 

At 71 1, all of the instructions in the block of instructions are emulated. According to an 
embodiment of the present invention, a plurality of IO instructions is emulated during a single 
20 virtualization event. 

At 712, an instruction pointer of the virtual machine executing the instruction stream is 
updated to indicate that the instructions in the block of instructions have been executed. Control 
proceeds to 701. 

Figures 5-7 are flow charts illustrating methods for performing input output accesses on a 
25 virtual device according to embodiments of the present invention. Some of the techniques 

illustrated in these figures may be performed sequentially, in parallel or in an order other than that 
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which is described. It should be appreciated that not all of the techniques described are required 
to be performed, that additional techniques may be added, and that some of the illustrated 
techniques may be substituted with other techniques. 

Figure 8 is a portion of an exemplary instruction stream 800 that is processed by a virtual 
5 machine according to an embodiment of the present invention. Instructions (801)-(815) represent 
various instructions and for the purposes simplification are labeled instructions 1-15, respectively. 
Instruction 3 (803) is the first IO instruction in the instruction stream 800. When instruction 3 
(803) is executed by the virtual machine, it triggers a virtualization event. According to an 
embodiment of the present invention, when the instruction stream 800 is scanned to determine 

10 whether additional IO instructions are within an extent of instructions of instruction 3 (803), an 
extent of ten instructions is used. Instructions 5 (805), instruction 8 (808), and instruction 1 1 
(81 1) are identified as being other IO instructions existing within the extent. In this example, 
instructions 3 through 1 1 (803)-(81 1) are grouped together as block of instructions (850). This 
block of instructions (850) may be emulated together during the virtualization event brought by 

15 instruction (803). Although executing instructions 4 (804), instructions 6-7 (806)-(807), and 
instructions 9-10 (809)-(810) in a virtual machine monitor context using an emulator is less 
efficient than executing the instructions natively in the virtual machine, the overall number of 
clock cycles required for executing the instruction stream 800 may be less by reducing the 
number of virtualization events required for each of the instructions 5 (805), instruction 8 (808), 

20 and instruction 11(811). 

In the foregoing specification embodiments of the invention has been described with 
reference to specific exemplary embodiments thereof. It will, however, be evident that various 
modifications and changes may be made thereto without departing from the broader spirit and 
scope of the embodiments of the invention. The specification and drawings are, accordingly, to 

25 be regarded in an illustrative rather than restrictive sense. 
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